Enhancing the robustness of Bayesian methods for text-independent automatic speaker verification
نویسندگان
چکیده
In this paper we present the main advances of the IRISA speech group from 2001 to 2004 in robust methods for Bayesian adaptation of speaker models and Bayesian decision. The probabilistic framework and the state-of-the-art Bayesian approach for automatic speaker verification are first recalled. We then describe two original contributions for robust Bayesian decision. The first one is a score normalization technique whose main advantage is that it does not need any external data as opposed to other score normalizations. The second technique is a constrained Bayesian adaptation scheme which operates a normalization of the speaker models in order to compensate for speakerdependent biases in the verification scores. Experiments using these two methods showed significant improvements over the baseline systems. Finally, theoretical developments of a hierarchical Bayesian adaptation scheme based on a dependency tree structure is presented, with preliminary experiment results.
منابع مشابه
Text Independent Speaker Modeling and Identification Based On MFCC Features
In this gives an overview of automatic speaker recognition technology, with an emphasis on textindependent recognition. Speaker recognition has been studied actively for several decades. We give an overview of both the classical and the state-of-the-art methods. We start with the fundamentals of automatic speaker recognition, concerning feature extraction and speaker modeling. Here, describe a ...
متن کاملComparison of background normalization methods for text-independent speaker verification
This paper compares two approaches to background model representation for a text-independent speaker verification task using Gaussian mixture models. We compare speaker-dependent background speaker sets to the use of a universal, speaker-independent background model (UBM). For the UBM, we describe how Bayesian adaptation can be used to derive claimant speaker models, providing a structure leadi...
متن کاملSpeaker characterization using principal component analysis and wavelet transform for speaker verification
In this paper, we investigate the use of the Wavelet Transform for text-dependent and text-independent Speaker Verification tasks. We have introduced a Principal Component Analysis based wavelet transform to perform frequencies segmentation with levels decomposition. A speaker dependent library tree has been built, corresponding to the best structure for a given speaker. The constructed tree is...
متن کاملImproving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities
In this paper we present a fusion methodology for combining prompted text-dependent and text-independent speaker verification operation modalities. The fusion is performed in score level extracted from GMM-UBM single mode speaker verification engines using several machine learning algorithms for classification. In order to improve the performance we apply clustering of the score-based data befo...
متن کاملStatistical methods and Bayesian interpretation of evidence in forensic automatic speaker recognition
The goal of this paper is to establish a robust methodology for forensic automatic speaker recognition (FASR) based on sound statistical and probabilistic methods, and validated using databases recorded in real-life conditions. The interpretation of recorded speech as evidence in the forensic context presents particular challenges. The means proposed for dealing with them is through Bayesian in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004